智能论文笔记

A Novel Approach For Generating Customizable Light Field Datasets for Machine Learning

Julia Huang , Toure Smith , Aloukika Patro , Vidhi Chhabra

分类：计算机视觉 | 人工智能

2022-12-13

To train deep learning models, which often outperform traditional approaches, large datasets of a specified medium, e.g., images, are used in numerous areas. However, for light field-specific machine learning tasks, there is a lack of such available datasets. Therefore, we create our own light field datasets, which have great potential for a variety of applications due to the abundance of information in light fields compared to singular images. Using the Unity and C# frameworks, we develop a novel approach for generating large, scalable, and reproducible light field datasets based on customizable hardware configurations to accelerate light field deep learning research.

translated by 谷歌翻译

Generalizable Natural Language Processing Framework for Migraine Reporting from Social Media

Yuting Guo , Swati Rajwal , Sahithi Lakamana , Chia-Chun Chiang , Paul C. Menell , Adnan H. Shahid , Yi-Chieh Chen , Nikita Chhabra , Wan-Ju Chao , Chieh-Ju Chao

分类：自然语言处理

2022-12-23

Migraine is a high-prevalence and disabling neurological disorder. However, information migraine management in real-world settings could be limited to traditional health information sources. In this paper, we (i) verify that there is substantial migraine-related chatter available on social media (Twitter and Reddit), self-reported by migraine sufferers; (ii) develop a platform-independent text classification system for automatically detecting self-reported migraine-related posts, and (iii) conduct analyses of the self-reported posts to assess the utility of social media for studying this problem. We manually annotated 5750 Twitter posts and 302 Reddit posts. Our system achieved an F1 score of 0.90 on Twitter and 0.93 on Reddit. Analysis of information posted by our 'migraine cohort' revealed the presence of a plethora of relevant information about migraine therapies and patient sentiments associated with them. Our study forms the foundation for conducting an in-depth analysis of migraine-related information using social media data.

translated by 谷歌翻译

A smart resource management mechanism with trust access control for cloud computing environment

Sakshi Chhabra , Ashutosh Kumar Singh

分类：人工智能

2022-12-10

The core of the computer business now offers subscription-based on-demand services with the help of cloud computing. We may now share resources among multiple users by using virtualization, which creates a virtual instance of a computer system running in an abstracted hardware layer. It provides infinite computing capabilities through its massive cloud datacenters, in contrast to early distributed computing models, and has been incredibly popular in recent years because to its continually growing infrastructure, user base, and hosted data volume. This article suggests a conceptual framework for a workload management paradigm in cloud settings that is both safe and performance-efficient. A resource management unit is used in this paradigm for energy and performing virtual machine allocation with efficiency, assuring the safe execution of users' applications, and protecting against data breaches brought on by unauthorised virtual machine access real-time. A secure virtual machine management unit controls the resource management unit and is created to produce data on unlawful access or intercommunication. Additionally, a workload analyzer unit works simultaneously to estimate resource consumption data to help the resource management unit be more effective during virtual machine allocation. The suggested model functions differently to effectively serve the same objective, including data encryption and decryption prior to transfer, usage of trust access mechanism to prevent unauthorised access to virtual machines, which creates extra computational cost overhead.

translated by 谷歌翻译

Sparse Gaussian Process Hyperparameters: Optimize or Integrate?

Vidhi Lalchand , Wessel P. Bruinsma , David R. Burt , Carl E. Rasmussen

分类： (统计)机器学习 | 机器学习

2022-11-04

The kernel function and its hyperparameters are the central model selection choice in a Gaussian proces (Rasmussen and Williams, 2006). Typically, the hyperparameters of the kernel are chosen by maximising the marginal likelihood, an approach known as Type-II maximum likelihood (ML-II). However, ML-II does not account for hyperparameter uncertainty, and it is well-known that this can lead to severely biased estimates and an underestimation of predictive uncertainty. While there are several works which employ a fully Bayesian characterisation of GPs, relatively few propose such approaches for the sparse GPs paradigm. In this work we propose an algorithm for sparse Gaussian process regression which leverages MCMC to sample from the hyperparameter posterior within the variational inducing point framework of Titsias (2009). This work is closely related to Hensman et al. (2015b) but side-steps the need to sample the inducing points, thereby significantly improving sampling efficiency in the Gaussian likelihood case. We compare this scheme against natural baselines in literature along with stochastic variational GPs (SVGPs) along with an extensive computational analysis.

translated by 谷歌翻译

Modelling Technical and Biological Effects in scRNA-seq data with Scalable GPLVMs

Vidhi Lalchand , Aditya Ravuri , Emma Dann , Natsuhiko Kumasaka , Dinithi Sumanaweera , Rik G. H. Lindeboom , Shaista Madad , Sarah A. Teichmann , Neil D. Lawrence

分类：机器学习 | (统计)机器学习

2022-09-14

单细胞RNA-seq数据集的大小和复杂性正在增长，从而可以研究各种生物/临床环境中的细胞组成变化。可扩展的降低性降低技术需要消除它们的生物学变异，同时考虑技术和生物混杂因素。在这项工作中，我们扩展了一种流行的概率非线性维度降低的方法，即高斯过程潜在变量模型，以扩展到大量的单细胞数据集，同时明确考虑技术和生物混杂因素。关键思想是使用增强的内核，该内核可以保留下限的可分式性，从而允许快速随机变化推断。我们证明了其在Kumasaka等人中重建先天免疫的潜在潜在签名的能力。（2021）训练时间较低9倍。我们进一步分析了一个共同数据集并在130个人群中证明了该框架，该框架可以在捕获可解释的感染签名的同时进行数据集成。具体而言，我们探讨了互联的严重程度，作为优化患者分层并捕获疾病特异性基因表达的潜在维度。

translated by 谷歌翻译

Kernel Learning for Explainable Climate Science

Vidhi Lalchand , Kenza Tazi , Talay M. Cheema , Richard E. Turner , Scott Hosking

分类：机器学习 | (统计)机器学习

2022-09-11

上印度河盆地喜马拉雅山为2.7亿人和无数的生态系统提供水。然而，在这一领域，降水是水文建模的关键组成部分。围绕这种不确定性的关键挑战来自整个盆地降水的复杂时空分布。在这项工作中，我们提出了具有结构化非平稳核的高斯过程，以模拟UIB中的降水模式。先前试图在印度库什karakoram喜马拉雅地区量化或建模降水的尝试通常是定性的，或者包括在较低分辨率下无法解决的粗略假设和简化。这项研究也几乎没有错误传播。我们用非平稳的Gibbs内核参数为输入依赖性长度尺度来解释降水的空间变化。这允许后函数样品适应印度河地区不同基础地形所固有的不同降水模式。输入依赖的长度尺寸由带有固定平方 - 指数内核的潜在高斯过程控制，以使功能级别的超参数平稳变化。在消融实验中，我们通过证明其对空间协方差，时间结构和关节时空重建的能力来激励所提出的内核的每个组成部分。我们通过固定的高斯工艺和深度高斯工艺进行基准测试模型。

translated by 谷歌翻译

Transformers are Adaptable Task Planners

Vidhi Jain , Yixin Lin , Eric Undersander , Yonatan Bisk , Akshara Rai

分类：机器人 | 人工智能 | 机器学习

2022-07-06

每个房屋都是不同的，每个人都喜欢以特殊方式完成的事情。因此，未来的家庭机器人需要既需要理由就日常任务的顺序性质，又要推广到用户的偏好。为此，我们提出了一个变压器任务计划者（TTP），该计划通过利用基于对象属性的表示来从演示中学习高级动作。TTP可以在多个偏好上进行预训练，并显示了使用单个演示作为模拟洗碗机加载任务中的提示的概括性的概括。此外，我们使用TTP与Franka Panda机器人臂一起展示了现实世界中的重排，并使用单一的人类示范引起了这种情况。

translated by 谷歌翻译

Short Blocklength Wiretap Channel Codes via Deep Learning: Design and Performance Evaluation

Vidhi Rana , Remi A. Chou

分类：机器学习

2022-06-07

我们在信息理论安全保证下为高斯窃听通道设计了简短的区块长度代码。我们的方法在于将代码设计中的可靠性和保密性限制解耦。具体而言，我们通过自动编码器处理可靠性约束，并处理具有哈希功能的保密约束。对于小于或等于16的区块长度，我们通过模拟合法接收器的错误概率以及我们的代码构建中的窃听器的泄漏进行评估。这种泄漏被定义为机密信息和窃听通道观察之间的共同信息，并通过基于神经网络的共同信息估计器进行经验测量。我们的仿真结果提供了具有正面保密率的代码的示例，这些代码优于高斯窃听通道的非结构性可获得的最知名的保密率。此外，我们表明我们的代码设计适用于化合物和任意变化的高斯窃听通道，为此，通道统计信息不是完全知道的，但仅属于预先指定的不确定性集。这些模型不仅捕获了与渠道统计估计有关的不确定性，而且还捕获了窃听器堵塞合法传输或通过更改其位置来影响其自身渠道统计的场景。

translated by 谷歌翻译

Study of Feature Importance for Quantum Machine Learning Models

Aaron Baughman , Kavitha Yogaraj , Raja Hebbar , Sudeep Ghosh , Rukhsan Ul Haq , Yoshika Chhabra

分类：机器学习

2022-02-18

预测器重要性是经典和量子机学习（QML）数据预处理管道的关键部分。这项工作介绍了此类研究的第一个研究，其中探索了对QML模型的重要性与其经典的机器学习（CML）等效物进行了对比。我们开发了一种混合量子式体系结构，其中训练了QML模型，并根据现实世界数据集上的经典算法计算特征重要性值。该体系结构已在ESPN幻想足球数据上使用Qiskit StateSvector模拟器和IBM量子硬件（例如IBMQ Mumbai和IBMQ Montreal Systems）实现。即使我们处于嘈杂的中间量子量子（NISQ）时代，物理量子计算结果还是有希望的。为了促进当前量子标尺，我们创建了一个数据分层，模型聚合和新颖的验证方法。值得注意的是，与经典模型相比，量子模型的特征重要性具有更高的变化。我们可以证明等效QML和CML模型通过多样性测量是互补的。 QML和CML之间的多样性表明，两种方法都可以以不同的方式促进解决方案。在本文中，我们关注量子支持向量分类器（QSVC），变分量子电路（VQC）及其经典对应物。 ESPN和IBM幻想足球贸易助理将高级统计分析与沃森发现的自然语言处理相结合，以提供公平的个性化贸易建议。在这里，已经考虑了每个播放器的播放器评估数据，并且可以扩展此工作以计算其他QML模型（例如Quantum Boltzmann机器）的特征重要性。

translated by 谷歌翻译

Kernel Identification Through Transformers

Fergus Simpson , Ian Davies , Vidhi Lalchand , Alessandro Vullo , Nicolas Durrande , Carl Rasmussen

分类： (统计)机器学习 | 机器学习

2021-06-15

内核选择在确定高斯过程（GP）模型中的性能方面发挥着核心作用，因为所选择的内核在之前的GP下确定了电感偏差和在GP下的功能的先前支持。这项工作解决了为高维GP回归模型构建自定义内核功能的挑战。从最近的深度学习进步中汲取灵感，我们介绍了一个名为Kitt的新方法：通过变压器识别内核识别。 KITT利用基于变压器的架构，以在0.1秒内生成内核建议，这比传统的内核搜索算法快几个数量级。我们使用从已知内核的词汇表中从前线生成的合成数据训练我们的模型。通过利用自我关注机制的性质，KITT能够处理具有任意尺寸的输入的数据集。我们证明，KITT选择的内核会在各种回归基准集合中产生强烈的表现。

translated by 谷歌翻译